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Abstract 

This paper is concerned with a leader-follower stochastic differential game with asymmetric information, where the information 
available to the follower is based on some sub-n-algebra of that available to the leader. Such kind of game problem has wide 
applications in finance, economics and management engineering such as newsvendor problems, cooperative advertising and 
pricing problems. Stochastic maximum principles and verification theorems with partial information are obtained, to represent 
the Stackelberg equilibrium. As applications, a linear-quadratic leader-follower stochastic differential game with asymmetric 
information is studied. It is shown that the open-loop Stackelberg equilibrium admits a state feedback representation if some 
system of Riccati equations is solvable. 
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1 Introduction 

Throughout this paper, we denote by K” the Euclidean 
space of n-dimensional vectors, by the space of 

n X d matrices, by 5" the space of n x n symmetric 
matrices. (•, •) and j-l denote the scalar product andnorm 
in the Euclidean space, respectively. T appearing in the 
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superscripts denotes the transpose of a matrix. fx,fxx 
denote the partial derivative and twice partial derivative 
with respect to x for a differentiable function /. 

1.1 Motivation 

First, we present two examples which motivate us to 
study leader-follower stochastic differential games with 
asymmetric information in this paper. 

Example 1.1: (Continuous Time Newsvendor Problem) 
Let D{-) be the demand rate for some product in a mar¬ 
ket, which satisfies the stochastic differential equation 
(SDE) 

r dD{t) = a{fi — D{t))dt + adW{t) -\- alU(t), 

1 1^(0) = do gR, 

where a, fi, a, d are constants. We consider the market 
consisting of a manufacturer selling the product to end 
users through a retailer. At time t, the retailer chooses 
an order rate q(t) for the product and decides its retail 
price R{t), and is offered a wholesale price w{t) by the 
manufacturer. We assume that items can be salvaged at 
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unit price S > 0, and that items cannot be stored, i.e., 
they must be sold instantly or salvaged. 

The retailer will obtain an expected profit 

Ji (?(•)) Ri-),w{-)) = E f [{Rit) - S) mm[D{t), q{t)] 
Jo 

— {w{t) — S')(z(t)] dt. 

When the manufacturer has a fixed production cost per 
unit M > 0, he will get an expected profit 

J 2 (qi-), Ri'), w{-)) = E [ {w{t) - M)q{t)dt. 

Jo 


Let Rt denote the a-algebra generated by Brownian mo¬ 
tions W{s), W{s), 0 < s < t. Intuitively JFt contains all 
the information up to time t. We assume that the infor¬ 
mation Qi^ti f/ 2 ,i available to the retailer and the manu¬ 
facturer at time t, respectively, are both sub-cr-algebras 
of Rf Moreover, the information available to them at 
time t is asymmetric and Qi^t C ^ 2 ,t- This can be ex¬ 
plained from the practical application’s aspect. Specifi¬ 
cally, the manufacturer chooses a wholesale price w{t) at 
time t, which is a t? 2 ,i-adapted stochastic process. And 
the retailer chooses an order rate q{t) and a retail price 
R{t) at time t, which are t/i_t-adapted stochastic pro¬ 
cesses. For any w{-), to select a ^i^t-adapted process pair 
{q* {■), R* {■)) for the retailer such that 

Mq*{-),R*{-),w{-)) 

= Ji{q*{-;w{-)),R*{-,w{-)),w{-)) 

= max 

and then to select a t/ 2 ,i-adapted process w*{-) for the 
manufacturer such that 

J 2 iq*{-),R*i-),W*{-)) 

= J 2 {q*{-;w*{-)),R*{-;w*{-)),w*{-)) 

= max J2((7*(•;«;(•)), 

“(•) 

formulates a leader-follower stochastic differential game 
with asymmetric information. In this setting, the re¬ 
tailer is the follower and the manufacturer is the leader. 
Any process triple {q* {■), R*{■), w* {■)) satisfying the 
above two equalities is called an open-loop Stackelberg 
equilibrium. In 0ksendal et al. [15], a time-dependent 
newsvendor problem with time-delayed information 
is solved, based on stochastic differential game (with 
jump-diffusion) approach. But it can not cover our 
model. 

Example 1.2: (Cooperative Advertising and Pricing 
Problem) In supply chain management of the market. 


there are usually two members, the manufacturer and 
the retailer. Cooperative advertising is an important 
instrument for aligning manufacturer and retailer de¬ 
cisions. Specifically, we introduce the following SDE, 
which is the generalization of Sethi’s stochastic sales- 
advertising model introduced by He et al. [8]: 

{ dx{t) = [/9u(t) \/l — x{t) — dx(t)] dt 

+ a{x{t))dW (t) -I- d{x{t))dW(t), 
x(o) = xo e [0,1], 

where x(t) represents the awareness share, i.e., the num¬ 
ber of aware (or informed) customers expressed as a frac¬ 
tion of the total market at time t, p is a response con¬ 
stant, and d determines the rate at which potential con¬ 
sumers are lost. cr(x), d(x) are functions satisfying usual 
conditions. 

At time t > 0, the manufacturer decides on the wholesale 
price w(t) and the cooperative participation rate 0(t), 
and the retailer decides the channel’s total advertising 
effort level u(t) and the retail price p(t). The sequence 
of the events is as follows. At time t, first, the manufac¬ 
turer announces his wholesale price w(t) and participa¬ 
tion rate 9{t). Second, the retailer sets his retail price p(t) 
and advertising effort rate u{t) as his optimal response 
to the manufacturer’s announced decisions. The retailer 
accomplishes this by solving an optimization problem to 
maximize his expected profit 

.Jiiw{-),e{-),ui-),p{-)) 

= E f [{p{t) - w{t))D{p{t))x{t) 

Jo 

— (1 — dt, 

where r > 0 is the discount rate and 0 < D{p) < 1 is the 
demand function satisfying usual conditions. The manu¬ 
facturer anticipates the retailer’s reaction functions and 
incorporates them into his optimal control problem, and 
solves for his wholesale price policy w{t) and the partic¬ 
ipation rate policy 0{t) at time t. Therefore, the manu¬ 
facturer’s optimization problem is to maximize his ex¬ 
pected profit 

= E f e~'^*[{w{t) — c)D{p{t))x{t) — 9{t)v?{t)\dt, 

Jo 

where c > 0 is the constant unit production cost. 

Define Tt '■= (j{W{s),0 < s <t}\/ ct{IF(s), 0 < s <t}. 
In the game setting, we assume that the information 
available to the retailer and the manufacturer 
at time t, respectively, are both sub-a-algebras of the 
complete information filtration Ef Moreover, the in¬ 
formation available at time t to them is asymmetric 
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and Qi^t C Q 2 ,t- In detail, the wholesale price w{-) and 
the participation rate 9{-) of the manufacturer are G 2 ,t- 
adapted processes. For the retailer, his advertising ef¬ 
fort level u(-) and retail price p(-) are to be ^i_t-adapted 
processes. For any (w(-), 0(-)), first, to select a suitable 
process pair (u*(-),p*(-)) for the retailer such that 


SDE (1) with values in K.", is the corresponding state 
process with initial state xq € M". Here b{t,x,ui,U 2 ) : 
H X [0,r] X R” X Hi X [/2 — > R", a{t,x,ui,U 2 ) ■ H x 
[0,T] X R” X Hi X C /2 ^ R’^x'^i, a(t,x,ui,U 2 ) : H x 
[0, T] X R” X Hi X H 2 —>■ R”xd 2 aj-g given J't-adapted 
processes, for each (x, t6i, M 2 ). 


JiM-),0(-),«*(•),p*(-)) 

= Ji(w(-),9(-),u*(-;(w(-),9(-)),p*(-;(w(-),0(-))) 
= max Ji(w(-),9(-),u(-),p(-)), 

u(-),p(-)>0 


and then to select a suitable process pair (w*(-),0*(-)) 
for the manufacturer such that 


J2(W*(-),&*(■),U(-),P*(-)) 

= , max J 2 (w(-),d(-),u*(-; (w(-),0(-)), 

w{-),O<0{-)<l 


Let us now explain the asymmetric information feature 
between the follower (player 1) and the leader (player 2) 
in this paper. Player 1 is the follower, which means that 
the information available to him at time t is based on 
some sub-cr-algebra Gi,t C G 2 ,t, where G 2 ,t is the infor¬ 
mation available to the leader at time t. We assume in 
this and next sections that both ^i_t and G 2 ,t are sub-a- 
algebras of the complete information filtration That 
is, we have ^i^t C 02,t C Tt- We define the admissible 
control sets of the follower and the leader, respectively, 
as follows. 


Ui := H X [0,T] —> Hi is ^i^t-adapted 

and sup ]E|'Ui(/)P < 00 , i = 1, 2, • • • 1 

o<t<T 1 


( 2 ) 


formulates a leader-follower stochastic differential game 
with asymmetric information. See also [8] for more de¬ 
tails about cooperative advertising and pricing mod¬ 
els in dynamic stochastic supply chains, where feedback 
Stackelberg equilibrium is obtained applying dynamic 
programming approach for stochastic differential game. 
However, the asymmetric information was not consid¬ 
ered there. 

1.2 Problem Formulation 

Inspired by the examples above, we study leader-follower 
stochastic differential games with asymmetric informa¬ 
tion in this paper. 

Let 0 < T < 00 be a finite time duration and (H, IF, P) be 
a complete probability space. (IF(-), IF(-)) is a standard 
R "^!+‘^2 .valued Brownian motion. Let {Jt}o<t<T be the 
natural augmented filtration generated by {W{-),W{-)) 
and Ft = F. 


Suppose that the state of the system is described by the 
SDE 


{ dx“^’“2(t) = b(t,x'^^’'^^{t),ui{t),U2{t))dt 

+ Cr{t, ’“2 (f) , Ml (t ), U2 (/)) dW(t) 

+ d{t, (t),ui{t), U2{t))dW(t), 

x“^’“^(0)= xo, 

( 1 ) 

where mi(-) and M2(-) are control processes taken by the 
two players in the game, labeled 1 (the follower) and 2 
(the leader), with values in nonempty convex sets Hi C 
R™!, U 2 C R’” 2 ^ respectively. a;“i’“ 2 (.), the solution to 


U2 '■= \u2\u2 '■ GL X [ 0 ,T] —> U2 is t/2,(-adapted 

’ 1 ( 3 ) 

and sup E|m 2 (/)P < 00 , i = 1, 2, • • • L 

0<(<T 1 

In the game problem, knowing that the leader has cho¬ 
sen M 2 (-) € H 2 , the follower would like to choose a Qi^t- 
adapted control Mi(-) = Mi(-;m 2 (-)) to minimize his cost 
functional 


Jl{ux{-),U2{-)) 
r rT 


= E 




{t),Ui{t),U2{t))dt 


-hGi(x“i’“2(r)) 


( 4 ) 


Here gi{t, x,ui,U 2 ) : H x [0, T] x R" x Hi x H 2 ^ R 
is an J^-adapted process, and Gi(a:) : H x R” —>• R is 
an J^T-nieasurable random variable, for each (x, mi, M 2 ). 
Now the follower encounters a stochastic optimal control 
problem with partial information. 

Problem of the follower. For any chosen M 2 (-) € U 2 
by the leader, choose a fli,(-adapted control m*(-) = 
«)■(•; M 2 (-)) G Gi, such that 


Ji(mi(-),M2(-)) = Ti(mi(-;m2(-)),U2(-)) 

= inf Ji(mi(-),M 2 (-)), 

subject to (1) and (4). Such a m*)-) = Mi(-;m 2(-)) is 
called a (partial information) optimal control, and the 
corresponding solution a;“i ’“^ (•) to (1) is called a (partial 
information) optimal state process for the follower. 
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In the following procedure of the game problem, once 
knowing that the follower would take such an optimal 
control Ui(-) = mJ(-;w 2(-))> the leader would like to 
choose a t/ 2 ,i-adapted control u^i') to minimize his cost 
functional 


J2{ut{-),U2i-)) 


+ G2(a;“i’“^(T)) 


( 6 ) 


Here g 2 {t,x,ui,U 2 ) ■ H x [0,T] x R” x {7i x [/2 ^ 
K, G 2 {x) : n X M” —>• M are given J'j-adapted processes, 
for each {x, mi, M 2 ). Now the leader encounters a stochas¬ 
tic optimal control problem with partial information. 


(1 -b Ixp -b |miP -b |M2p) ^| 5 l(t,a;,Mi,M 2 )| 

-b (1 -b |x| -b |mi| -b |m2|) ^(\gixit,X,Ui,U2)\ 

+ \giuiit,x,ui,U2)\ + |gi„2(t,a;,Mi,M2)|) 

+ | 5 la;a;(t,a;,Mi,M 2 )| “b \giuiuiit, X, Ui, U 2 )\ 

+ |5i«2U2(t,a;,Mi,M2)| < C, 

(1 -b \x\^ -b |miP -b |M2p) ^| 52 (t,a;,Mi,M 2 )| 

-b (1 -b \x\ + |mi| -b |m2|) ^(\g2xit,X,Ui,U2)\ 

+ |ff2«i(t,a;,Mi,M2)| -b \g2u2(t,X,Ui,U2)\^ < c. 

1.3 Literature Review and Contributions of This Paper 


Problem of the leader. Find a t/ 2 ,(-adapted control 
M 2 (-) GU 2 , such that 

= inf J2 (mi(-;m2(-)),M2(-))) 

U 2 &A 2 

subject to (1) and (6). Such a M 2 (-) is called a (partial 
information) optimal control, and the corresponding so¬ 
lution a:*(-) = a;“*’“ 2 (-) to (1) is called a (partial infor¬ 
mation) optimal state process for the leader. We will re¬ 
state the problem for the leader in more detail in the 
next section, since its precise description has to involve 
the solution to the Problem of the follower. 


Initiated by Issacs [12], differential games are useful in 
modeling dynamic systems where more than one deci¬ 
sion maker are involved. Differential games have been 
investigated by many authors and have been found to be 
a useful tool in many applications, particularly in biol¬ 
ogy, economics and finance. Stochastic differential games 
are differential games for stochastic systems involving 
noise terms. See the monographs by Basar and Olsder 
[4] for more information about differential games. For 
some most recent developments for stochastic differen¬ 
tial games and their applications, please refer to Yong 
[25], Hamadene [7], Wu [23], An and 0ksendal [I], Buck- 
dahn and Li [6], Wang and Yu [20,21], Yu [29], Hui and 
Xiao [10,II], Shi [17] and the references therein. 


We refer to the problem mentioned above as a leader- 
follower stochastic differential game with asymmet¬ 
ric information. If there exists a control process pair 

satisfying (5) and 
(7), we refer to it as an open-loop Stackelherg equilibrium. 

In this paper, we impose the following assumptions. 

(Al.l) For each to € LI, the functions b,a,a,gi are twice 
continuously differentiable with respect to {x, mi, M 2 ). For 
each uj G n, functions g 2 and Gi,G 2 are continuously 
differentiable with respect to (x,mi,M 2 ) and x, respec¬ 
tively. Moreover, for each to G Ll and any ft, cc, mi, M 2 ) G 
[0, T] X R” X R™! X R'"^, there exists G > 0 such that 

(l-b [cc]-b |mi|- b |m2|) ^\(j){t,X, Ul,U2)\ 

-b \(t)xit,X,Ui,U2)\ -b \(l)mit,X,Ui,U2)\ 

-b \(l)u 2 it,X,Ui,U 2 )\ -b \(l)xxit,X,Ui,U2)\ 

“b I (t, X, Ml, M 2 ) I -b I (^5 ^5 1 ^ 2 ) I ^ G, 

for (p = b,a, a, and 

(1 -b |xp) |Gi(x)| -b (1 -b |xj) |Gia;(x)| 

-b (1 -b |xp) |G2 (x)| -b (1 -b |x|) |G2a;(x)| < G, 


Leader-follower stochastic differential game is the dy¬ 
namic and stochastic formulation of the well-known 
Stackelberg game, which was introduced by Stackelberg 
[18] in 1934, when he defined a concept of a hierarchical 
solution for markets where some firms have power of 
domination over others. This solution concept is now 
known as the Stackelberg equilibrium which, in the con¬ 
text of two-person nonzero-sum games, involves players 
with asymmetric roles, one leader and one follower. 
Early study for stochastic Stackelberg differential games 
can be seen in Basar [3]. In detail, a leader-follower 
stochastic differential game (or stochastic Stackelberg 
differential game) proceeds with the follower aims at 
minimizing his cost functional in accordance with the 
leader’s strategy on the whole duration of the game. 
Anticipating the follower’s optimal response depending 
on his entire strategy, the leader chooses an optimal one 
in advance to minimize his own cost functional, based 
on the stochastic Hamiltonian system satisfied by the 
follower’s optimal response. The pair of the leader’s 
optimal strategy and the follower’s optimal response is 
known as the Stackelberg equilibrium. 
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To our best knowledge, there are few papers on leader- 
follower stochastic differential games. A pioneer work 
was done by Yong [26], where a linear-quadratic (LQ) 



leader-follower stochastic differential game was intro¬ 
duced and studied. The coefficients of the system and 
the cost functionals are random, the controls enter the 
diffusion term of the state equation, and the weight ma¬ 
trices for the controls in the cost functionals are not 
necessarily positive definite. To give a state feedback 
representation of the open-loop Stackelberg equilibrium 
(in a non-anticipating way), the related Riccati equa¬ 
tions are derived and sufficient conditions for the ex¬ 
istence of their solution with deterministic coefficients 
are discussed. Bensoussan et al. [5] obtained the global 
maximum principles for both open-loop and closed-loop 
stochastic Stackelberg differential games, whereas the 
diffusion term does not contain the controls. The solv¬ 
ability of related Riccati equations is discussed, in order 
to obtain the state feedback Stackelberg equilibrium. 

In this paper, we initiate to study a leader-follower 
stochastic differential game with asymmetric informa¬ 
tion, which distinguishes itself from the literatures men¬ 
tioned above in the following aspects, (i) In our frame¬ 
work, both information filtration available to the leader 
and the follower could be sub-cr-algebras of the complete 
information filtration naturally generated by the ran¬ 
dom noise source. Moreover, the information available 
to the follower is based on some sub-cr-algebra of that 
available to the leader. This gives a new explanation for 
the asymmetric information feature between the follower 
and the leader, and endows our problem formulation 
more practical meanings in reality, (ii) An important 
class of LQ leader-follower stochastic differential games 
with asymmetric information is first proposed and then 
solved. It consists of a stochastic optimal control prob¬ 
lem of SDE with partial observation for the follower, 
and followed by a stochastic optimal control problem 
of conditional mean-field forward-backward stochastic 
differential equation (FBSDE) with complete informa¬ 
tion for the leader. This problem is new in differential 
game theory and have considerable impacts in both 
theoretical analysis and practical meaning with future 
application prospect, although it has intrinsic mathe¬ 
matical difficulties. Note that in [26], both problems for 
the follower and leader are to be solved stochastic op¬ 
timal control problems with complete information, (iii) 
The open-loop Stackelberg equilibrium of this LQ game 
problem with asymmetric information, is characterized 
in terms of the forward-backward stochastic differential 
filtering equation (FBSDFE), which arises naturally in 
our setup. To our best knowledge, these FBSDFEs are 
new in both stochastic control and filtering theory. In 
particular, they are different from those in Zhang [30], 
Huang et al. [9] and Wang and Yu [21]. (iv) State feed¬ 
back representations for the optimal controls of the fol¬ 
lower and the leader, are explicitly given with the help 
of some new Riccati equations. Both problems for the 
follower and the leader have mathematical difficulties, 
and we overcome them via some measure transforma¬ 
tion and filtering technique in Xiong [24], Wang et al. 
[19], linear FBSDE decoupling technique in [26] and 


mean-field FBSDE decoupling technique in Yong [27], 
respectively. 

The rest of this paper is organized as follows. In Sec¬ 
tion 2, we formulate the problems of the follower and 
the leader, and then solve them in this order. In Sub¬ 
section 2.1, we first solve the stochastic optimal control 
problem with partial information of the follower, apply¬ 
ing the theory of controlled SDEs with partial informa¬ 
tion. The partial information maximum principle and 
verification theorem are shown. In Subsection 2.2, we 
then formulate the stochastic optimal control problem 
of conditional mean-field FBSDE with partial informa¬ 
tion of the leader, regarding the stochastic Hamiltonian 
system corresponding to the follower’s optimal response 
as his state equation. Via controlled conditional mean- 
field FBSDEs, the maximum principle and verification 
theorem with partial information are both given. In Sec¬ 
tion 3, we apply our theoretical results to an LQ leader- 
follower stochastic differential game with asymmetric 
information. Specifically, Subsection 3.1 is devoted to 
the solution of an LQ stochastic optimal control prob¬ 
lem with partial observation of the follower. The key 
technique is to get some observable optimal controls by 
explicitly computing the optimal filtering estimates of 
the corresponding adjoint processes, when applying the 
maximum principle approach and measure transforma¬ 
tion. The state feedback form of the observable optimal 
control for the follower is then represented by some sym¬ 
metric Riccati equation under appropriate assumptions. 
Here, the state of the follower is represented by some FB¬ 
SDFE. Subsection 3.2 is devoted to the solution of an LQ 
stochastic optimal control problem of conditional mean- 
field FBSDE with complete information of the leader. 
The state feedback representation of optimal control is 
derived. Since the filtering estimates of the original state 
and the control of the leader are involved in its state 
equation, we encounter difficulty. We overcome this dif¬ 
ficulty by applying the maximum principle and verifi¬ 
cation theorem for conditional mean-field FBSDEs, via 
the solution to some adjoint conditional mean-field FB¬ 
SDE. The state feedback representation for the optimal 
control of the leader (together with the optimal control 
of the follower) is obtained via some new system of Ric¬ 
cati equations in the double dimensional spaces, when 
the dimensional augmentation by Yong [26] is applied. 
Thus, the state feedback representation of the open-loop 
Stackelberg equilibrium can be given. Finally, Section 4 
gives some concluding remarks. 

2 Maximum Principle and Verification Theo¬ 
rem for Stackelberg Equilibrium 

2.1 The Follower’s Problem 

In this subsection, we first solve the partial information 
stochastic optimal control problem of the follower, that 

is Problem of the follower. 
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For any chosen M2(-) € U2, suppose that there exists 
an optimal control mJ(-) for the follower, and the corre¬ 
sponding optimal state is the solution to (1). 

Let an J^^-adapted process triple {q{-),k{-),k{-)) G R" x 
jgnxdi ^ ]g^nxd2 uniquely solves the adjoint BSDE of the 
follower 

-dq{t) = |62,(t,x“i’“"(t),'u*(t),U2(t))g(t) 

di 

+ XI (^’ ’ "“2 {t))k^ (t) 

i=i 

d2 ^ 

' + X (^’ d), u*i (t), U 2 {t))k^ (t) 

i=i 

- 9 ix (t, a;“* (t), u\ (t), U2(t)) |dt 

— k{t)dW(t) — k(t)dW(t), 

. q{T)=-G,,{x<’^^iT)), 

( 8 ) 

where we denote ax = (crj,, • • • ,crf^) ,dx = 
and similar notations will be used. 

Define the Hamiltonian function of the follower Hi : 
n X [ 0 , T] X R” X C/i X C/2 X R” X R”^'*i x ^ R as 

Hi[t,x,Ui,U2]q,k,k) = {q,b{t,x,Ui,U2)) 

+ ti^k^a{t,x,ui,U2)} -I-tr{fc^CT(t, X, ui, ^ 2 )} (9) 

- 5i(t,a;,Mi,U2)- 


Similar to [21], we have the following results. 

Proposition 2.1 (Partial Information Maximum 
Principle) Suppose that (Al.l) holds. For any given 
U 2 {-) G 1^2, let u\(-) he the optimal eontrol for Prob¬ 
lem of the follower, and x“i ’“^ (•) be the eorresponding 
optimal state. Let (q(-),k{-),k{-)) be the adjoint process 
triple. Then we have 


E 


{Hiu, {t, (t), u*i{t),U 2 {t)] q{t),k{t), k{t )), 

Ml-«*(/))bytj >0, a.e.t G [0,T],a.s., 


holds, for any ui G Ui. 


( 10 ) 


Proposition 2.2 (Partial Information Verifica¬ 
tion Theorem) Suppose that (Al.l) holds. For 
any given U 2 (-), let u]](-) G and a;“i’“^(-) be the 

eorresponding state. Let {q{j,k{t),k{j) be the adjoint 
process triple. Moreover, for each ft, tv) G [0,r] x ft, 

Hi(t,-,-,U2{f)-,q{f),k{t),k{f)) is concave, Gi(-) is con¬ 
vex, and 


E ^Hi [t, (t), ul(t), W2(t); q(t),k{t), fc(t)) 

= max E[//i(t,a;“i’“^(t),Mi,U2(t);(7(t),fc(t),fc(t))|6i,tl, 
tti Gt/i L I J 

(11) 

holds for a.e.t G [0,T], a.s. Then u]](-) is an optimal 
control for Problem of the follower. 

2.2 The Leader’s Problem 

In this subsection, we first state the partial information 
stochastic optimal control problem of the leader in de¬ 
tail, that is Problem of the leader. Then we give the 
maximum principle and verification theorem for it. 

For any U 2 {j G U 2 , by the maximum condition (10), we 
assume that a functional ul{t) = ■u*(t;x“i’“^(t),{(2(t), 

q{t), k{t), fc(t)) is uniquely defined, where 
r :=E[x<’“^(f)|ei,t], 

J ■U2(t) := E[M2(t)|l/i,t], q{t) :=E[q{t)\gi^t], (12) 
[kit) :=E[kit)\gi^t], k{t) ■.= E[kit)\gi,t]- 

For the simplicity of notations, we denote x“^(-) = 
a;“i’“^(-) and define on D x [0,T] x R" x U 2 as 

(j)^{t,x'^'^{f),U 2 {t)) := (/)(t,a;“*’“"(t), 

u*i {t; (t ), U2 (t) ,qit), k{t) ,k{t)),U2 (t)) , 

for (j) = b,a,d,gi, respectively. Then after substituting 
the above control process Ui(-) into the follower’s ad¬ 
joint equation (8), the leader encounters the controlled 
FBSDE system 

' dx^’^ff) = b^(t,x’^’^{t),U2it))dt 

+ a^{t,x’^'^{f),U2{t))dW{f) 

+ (t, (t), U2 (/)) dW(t), 

-dqit) = |6^(t,a;“"(t),'U2(t))g(t) 

di 

+ J2af:^{t,x-Ht),U2{t))k\t) 

< (13) 

d2 

+J2df;^{t,x-%t),u2{t))yit) 

1=1 

(f),U2(t))|dt 

— kif)dW{t) — k{f)dW{f), 

, x“^(0 )=xo, qiT) = -Gixix^^iT)). 

Note that (13) is a controlled conditional mean-field FB¬ 
SDE, which now is regarded as the “state” equation of 
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the leader. That is to say, the state for the leader is the 
quadruple (•), q{-), k{-), %{■)). 

Remark2.2 The equality (t) = u* (t; (t), ■U 2 (t), 

q{t),k{t),k{t)) does not hold in general. However, for 
LQ case, it is satisfied and we will make this point clear 
in the next section. 


Define 


+ G2(x“*’“"(T)) 


.Jo 

q{t),k(t),k{t)) ,U2{t))dt + G 2 (a;“i’“^ (T)) 

:=e[ [ g^{t,x^%t),U2{t))dt + G2{x^HT)) 


(14) 


where g^ : ^ x [0, T] x R" x G 2 ^ R. Note the cost 
functional of the leader is also conditional mean-field’s 
type. We propose the stochastic optimal control problem 
with partial information of the leader as follows. 


Problem of the leader. Find a t/ 2 , (-adapted control 
M 2 (-) &U 2 , such that 

= inf J 2 {u 2 {-)), (15) 

U2^U2 


subject to (13) and (14). Such a ■U 2 (-) is called a (par¬ 
tial information) optimal control, and the corresponding 
solution x*{-) = x“ 2 (-) to (13) is called a (partial infor¬ 
mation) optimal state process for the leader. 

Suppose that there exists an optimal control (•) for the 
leader, and the corresponding state {x*{-), q*{-),k*{-), 
fc*(-)) is the solution to (13). Define the Hamiltonian 
function of the leader H 2 : D x [0, T] x R" x U 2 x R" x 
Rnxdi X ^nxd2 X R" X R” X R"X''i X R"x<i 2 x R” R as 

H2{t,x'^'^,U2,q,k,k-,y,z,z,p) 

= (y, 6^(^,x“^■U2)) -btr{z^CT^(t,x“%W2)} 

-I- tr{z^(T''"(t, ^2)} -b g^it, , U2) 

- (p, {t, ^2)9 -b ^ cr^-’ {t, , ^2)^ 

d 2 ^ 

+ ^'^x^{t,x''^,U2)k^ - gi^{t,x^\u2) 

1=1 


n ^ -pnxdi ^ ]^nxd2 ^ 


Let {y{-),z{-),z(-),p{-)) e R” x 
be the unique J^(-adapted solution to the adjoint condi¬ 
tional mean-field FBSDE of the leader 


[ dp{t) = {6^*(t)p(t) -bE[6t*(t)p(t)|^i,(]}dt 

d\ 

Y {^x*^d)pit) 

1=1 

+ E[dYmt)\Gi,t]}dW\t), 
-dy{t) = i^b^*{t)y{t) +E[bk*{t)y{t)\gi^t] 

+ Y [^x*^d)z^t)+E[a^*^{t)z^{t)\gi^t] 
1=1 
d2 

+ Y ^x*^dwd)+E[d’^*\t)z^{t)\gi,t] 

-Y{%d)qddPd^ 


1=1 

d2 

Y 

1=1 


-E 


dbi* 


it)q* iOptit) 


Gi, 


dx 

n f) / 

Y{-^iY^^*'d)k^d) ]p^d) 

d 


} 


+[ 4 “ (£ I ^'’‘ 1 } 

Tl pi / \ 

i—l ^ j—X ' 

+[^ (E dw d)) p. d) I ^^1.*]} 

+ 9iPd)pd) +E[giP{t)p(t)\gi^t] 

+ g^:{t)+E[g^iit)\g^Adt 


(17) 


— z{t)dW{t) — z(t)dW{t), 

p(0) = 0, 

I y{T) = GiUx*{T))p{T) + G 2 Px*{T)), 


where we have used = (j)^(t,x*{t),x*{t),U 2 (t), 

U 2 {t)) for (j) = b,(T, a, gi,g2 and all their derivatives. 


Now, we have the following two results. 
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Proposition 2.3 (Maximum principle of condi¬ 
tional mean-field FBSDE with partial informa¬ 
tion) Suppose that (Al.l) holds. Let u^i') & U 2 
he an optimal control for Problem of the leader 
and {x* {■), q*{•), k*{•), k* {■)) be the optimal state. Let 
{y(-), z{-),z{-),p{-)) be the adjoint quadruple, then 


E 


) a;* (t), u*2 (t), q* (t), k* (t), k* (t ); 
y{t) ,z{t), Z{t) , p{f)) ,U2-U2 {t) ^ 

+ (E[iJ2«2 x*{t),U2{t),q*{t), k*{t),k*{t);y{t), 


z{t), z{f),p{f))\gi^t \, U2 - ul{t) 

> 0, a.e.t € [0, T], a.s., 
holds for any U 2 &U 2 . 


f?2,i 


(18) 


Proof. The maximum condition (18) can be derived 
by convex variation and adjoint technique, as Anderson 
and Djehiche [2]. We omit the details for saving space. 
See also Li [13], Yong [27] and the references therein for 
mean-field stochastic optimal control problems. □ 


Proposition 2.4 (Verification theorem of condi¬ 
tional mean-field FBSDE with partial informa¬ 
tion) Suppose that (Al.l) holds. Let U 2 { ) G U 2 and 
{x* (■), q* {■), k*{■), k*{•)) be the corresponding state, with 
Gia:x(x) = Gi G 5”. Let {y{-),z{-),z{-),p{-)) be the ad¬ 
joint quadruple. For each {t, to) G [0, T] x LI, suppose that 
H 2 {t, y{t),z{t),z{t),p{t)) andG 2 {-) are convex, 

and 


E 


H 2 {t, x*{t), u* 2 {t),q*{f), k*{t), k*{t); 

y{t),z{t),z{t),p{t)) 

-f E [H 2 {t, x* (t ), U 2 (t ), q* (t), k* (t), k* (t) ;y{t), 


z{t),z{t),pit))\gi,t 


f/2,t 


= max E 

U 2 GU 2 


H 2 {t,x*(t), U 2 , q*(t),k*(t),k*it); 
y{t),z{t),^t),p{t)) 

-\-E[H 2 {t,x*(t),U 2 ,q*{t),k*(t),k*{t);y(t), 


z{t),z{t),p{t))\gi,t 


f?2,t 


(19) 

holds for a.e.t G [0,T],a.s. Then u^i-) is an optimal 
control for Problem of the leader. 


Proof. This follows similar to Shi [16]. We omit the 
details for simplicity. □ 


^ 2 ,t or more complicated relations between them may 
hold. The study of the leader-follower game in such cases 
is similar to the one which we studied in this paper. We 
omit them to limit the length of the paper. 


3 LQ Leader-Eollower Stochastic Differential 
Game with Asymmetric Information 


In order to illustrate the theoretical results in Section 
2, we study an LQ leader-follower stochastic differential 
game with asymmetric information. For notational sim¬ 
plicity, we consider di = d 2 = 1. This game is a par¬ 
tially observed model which is a special case of the one 
in Section 2, but the resulting deduction is technically 
demanding. We split this section into two subsections, 
to deal with the problems of the follower and the leader, 
respectively. 


3.1 The Follower’s LQ Problem 


Suppose that the state (x“i ’“^ (•), a; (•)) G M” x R” of the 
system is described by the linear SDE 



. V ^(0) / V^o/ 

(20) 

where ui{-) takes values in ]R’"Lu 2 (-) takes val¬ 
ues in Noting that the second part of the 

state x{-) is not controlled. Here A(-), G(-), Gi (•) G 
5", G(-),G2(-),G3(-) G R", Bi(-),Di(-) G R"X’”S 
and H 2 (-), I? 2 (-) G R "^™2 ^j.g deterministic, bounded 
matrix-valued or vector-valued functions, and xo,xo G 
R”. 


Remark 2.3 In some applications, for example, the Supposethatthestate (x“^’“^(-),x(-)) cannot bedirectly 

principle-agent problems, the opposite inclusion Ib observed by the follower. Instead, he can observe a re- 




lated process F(•) G M which is governed by the SDE 



( 21 ) 

where h : [0,T] ^ K” is a given continuous, bounded 
vector-valued function. 


minimize his cost functional 

{Ni{t)ui{t),ui{t))^dt 


= -E 
2 


(25) 


Inserting (21) into (20), we have 


< 


= [^(t)x“i’“=(t) -/i^(t)G(t)J(t) 

-I- Bi {t)ui (t) + B 2 {t)u 2 (t)] dt 
+ + Di{t)ui{t) 

+ D2{t)u2{t)]dW{t) + C{t)dY{t), 
dx{t) = [Ci{t) — {t)C 3 {t)]x{t)dt 

+ C2{t)dWit) + C3{t)dY{t), 

a;“i’“2(0) = xq , a:(0) = aio- 


( 22 ) 


Let us now introduce an R-valued process 


Here (5i(-) G 5",iVi(-) G are deterministic and 
bounded matrix-valued functions, and Gi G 5”. We in¬ 
troduce the following assumption. 

(A3.1) gi(t) > 0,Vt G [0,T] andGi > 0. 

The admissible control set Ui for the follower is defined 
as (2) in Section 1, for Qi^t = this section. And we 

will denote the above as LQ Problem of the follower. 

We apply the maximum principle approach in Subsec¬ 
tion 2.1 to solve this partially observed stochastic opti¬ 
mal control problem of the follower. In the following, we 
will split the process of finding observable optimal con¬ 
trol Mi(-) for the follower into three steps. 


Z{t) = exp 


— / {s)x{s)dW{s) 


1 


|/i^(s)x(s)pds 


Step 1. (Optimal control) 

(23) First, we put the above problem into the filtered prob¬ 
ability space (O, {Jy^’^}o<t<rj ]P)j in order to apply 

Propositions 2.1 and 2.2. By (21) and (24), we obtain 


which is the solution to the SDE 

( dZ{t) = —Z{t)h^{t)x{t)dW{t), 
I Z(0) = 1. 


r dZ{t) = Z{t)\h^{t)x{t)\'^dt — Z{t)h^{t)x{t)dY{t), 

I Z{0) = 1. 

(24) (26) 

Moreover, we have 


It is obvious that Z{-) is a ((J^t)o<t<T, ]P)-niartingale, 
since the Novikov condition holds: 


E 



< 00 


(see Example 3, Section 6.2, Liptser and Shiryayev [14]). 
We thus can define a new probability measure P such 
that for any t,dV = Z{t)dF, on Tt- By Girsanov’s the¬ 
orem and (21), (IE(-),y(-)) is a 2-dimensional stan¬ 
dard Brownian motion defined on (12, (J^t)o<t<T 7 P)- 

In this section, we denote Tt = 

j-w.y ^ j-w ^ j:Y^ where are the natu¬ 

ral filtration generated by Brownian motions W {■),]¥(•) 
and E)-)? respectively. Moreover, it can be easily checked 
that Bt = . 


For chosen U 2 {-) by the leader, the follower would like 
to choose an Jy^-adapted control ul{-) = ul{-;u 2 {-)) to 


dZ-\t) = Z-\t)h^ {t)^t)dY{t), 

z-i(o) = i. 


that is. 


Z ^(t) = exp|y {s)x{s)dY(s) 


|ft-^(s)a;(s)p(is 


(28) 


By Bayes’ formula (Theorem 3.22, Xiong [24]), we have 
Ji{ui{-),U2{-)) 


= 1^ 


L.^0 


Z-i(t)Qi(t)x“^’“2(t),x“^’“2(t)) 

+ {Z~^{t)Ni{t)ui{t),ui{t))'^dt ^ ^ 

+ {Z-YT)Gix^^’^^T),x^^’^^T)) , 
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where E is the mathematical expectation with respect 
to P. The Hamiltonian function (9) of the follower, now 
takes the form 


for some deterministic and differentiable iS"-matrix val¬ 
ued function Pi(-), and ]R"-valued J'^'^’^-adapted pro¬ 
cess ip{-) admits 


Hi(t, X, X, ui,U2;q, k, k; Z~^) 

= (q, A{t)x - {t)C{t)x -I- Bi{t)ui + B 2 {t)u 2 ) 

+ (k,C{t)x -I- Di{t)ui + D 2 {t)u 2 ) (k, C{t)) 

- ^{Z~^Qi{t)x,x) - ^{Z-^Ni{t)ui,ui), 

(30) 


and the adjoint equation (8) writes 


J dip(t) = a{t)dt -k ld(t)dY(t), 

I ^(T) = 0. 

In the above, a(-) G M" is J'/^’^-adapted and /3(-) € R” 
is -adapted process, which will be determined later. 

Now, applying Ito’s formula to (34), we have 


' -dq{t) = {t)q{t) + {t)k{t) 

- Z-^(t)Qi(t)x“i’“^(t)]dt 
-k{t)dW{t)-k{t)dY{t), 
q{T)= - Z-^{T)Gix<'^^{T), 

with q{-), k{-),k{-) valued in R" x R" x R" being 
adapted processes. 

For given U 2 {■)^ suppose that there exists an Jy^-adapted 
optimal control u*(-) &Ux for the follower, and the cor¬ 
responding optimal state a;“i (•) is the solution to (22), 
that is. 


' dx<'^^{t) = [A{t)x<'^-^{t) - h^{t)C{t)x{t) 

Y Bi{t)u\{t) Y B 2 {t)u 2 {t)\dt 

< Y[C{t)x<''^^{t)YDi{t)ul{t) 

Y D2(t)u2{t)\dW{t) Y C(t)dY{t), 

(32) 

where x{-) G R" can be solved from the second equation 
of (22), since it is a process which is not controlled. 

Then (30) together with Proposition 2.1 yields that, 

0 = ^I(i; P)N^{t)ul{t) - Bj{t)q{t- P) 

— DJ{t)k{t] P), for a.e.t G [0, T], 


with 

iz^Ht-,P):=E[Z-\t)\Pn,^ 

\ q{t- P) := E[q{t)\P^], fc(t; P) := E[fc(t)|Pf ]. 


dq{t) = - Z-\t) [Pi{t)dx< (t) -f A {t)x<{t)dt 

Y d(p{t)\ - dZ“^(t)[Pi(t)x“i’“^(t) -I- ip{t)\ 

— dZ~^{t) [Pi {t)dx^^ (<) + d(p{t)] 

= - Z-^it)[Pi{t)x<’^%t) Y Piit)Ait)x<’^^t) 

Y Piit)Bit)u*iit) Y Pi{t)B2{t)u2{t) 

Y hJ{t)x{t)(3{t) Y dt 
-Z-^it)Pi{t) [C{t)x<’^%t) 

Y Di{t)ul{t) Y D2{t)u2{t)]dW{t) 

- Z-\t) [Pi{t)C{t) Y Pi {t)h^{t)^t)x< {t) 

Y {t)x{t)(p{t) Y P{t)]dY{t). 

(36) 

Comparing the {■}dW{t), {■}dY{t) and {-jdt terms of 
(36) with those of the adjoint equation (31), we arrive at 

kit) = -Z-\t)Piit)[Cit)x<^'^Ht) Y Diit)u*iit) 

Y D2it)u2it)] , 

k{t) = — Z~^it) [Pi(t)C'(t) -I- Pi(t)/i^(t)x(t)a;“i’“^(t) 

Y {t)xit)(pit) Y Pit)], 

(38) 

and 

ait) = [- Piit)- Piit)Ait) - A^it)Piit) 

- C^it)Piit)Cit) - Qiit)]x<’^^it) 

— A^ it) (fit) — it) X it) Pit) (39) 

- [Pi(t)Pi(t) + C^it)Piit)Diit)]u*iit) 

- [Piit)B 2 it) Y it)Piit)D 2 it)]u 2 it), 
respectively. By Bayes’ formula, noting (34), we have 


Step 2. (Optimal filtering) 

Next, noting that the terminal condition of (31), it is 
natural to set 


qit-P) := E[qit)\Pl] = 


E[Zit)qit)\P} 

E[Zit)\Tn 


-Piit)x^^’^^it) - (fit) 


E[Zit)\P, 


•Vl 


(40) 


qit) = -Z-\t)[Piit)x<’-Ht) YYit)], t € [0,T], (34) 


where a;“i’“^(t) := E[x'^^’^^it)\P^],(pit) := E[(/?(t)|Pj^]. 
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Similarly, by (37) we get 


for a.e. t £ [0,T], where we denote 


k(t;F) :=E[k(t)IF^] = 


E[Z(t)k(t)lF,^ 

E[Z{t)\Fr] 


-Pi(t)C'(t)x“i’“2(t) - Pi{t)Di{t)ul{t) - Pl{t)D 2 {t)u 2 {t) 


E[Z{t)\P^ 


where U 2 {t) := E[u 2 {t)\PY] 


(41) 


r Ni{t) := Ni{t)+Dj{t)Pi{t)Di{t), 

\ Si{t) := Pi(t)Bi(t) +C'^(t)Pi(t)i?i(t), 

[ S{t)-.= Dj{t)Pi{t)D2{t). 

Substituting (45) into (39), we can obtain that if the 
ordinary differential equation 


Applying Lemma 5.4 in Xiong [24] to the state equation 
(32) of the follower, we derive the filtering equation 

' dx“‘’“"(t) = [A(t)x“*’*=(t) - {t)C{t)x{t) 

+ Bi{t)ul{t) + B2{t)u2{t)\dt 
+ C{t)dYit), 

. x“^“M0)= xo, 

where x{t) := E\x{t)\PY\ satisfying 

f dx{t) = \Ci{t) — {t)C^{t)\x{t)dt + Cz{t)dY{t), 

\ x(0) = xo, 

which can be easily solved. Thus (42) admits a unique 
Jy^-adapted solution x“i’“2(-), for any U 2 {-)- 

Step 3. (Optimal state feedback control) 

By (33), (40) and (41), in order to obtain state feedback 
form of the follower’s optimal control, the remaining task 
is to give the representation of Z{t) := E[Z(t)|Jy^], since 
by Bayes’ formula, we have 

^Yt-P) :=E[Z-\t)\PY] 

E[Z{t)\FY] ■ Z{t) - ^ 


r Pi {t) = -Pi {t)A{t) - {t)Pi {t) - {t)Pi {t)C{t) 
\ +Si{t)N^\t)Sj{t)-Qi{t), 

[Pi{T) = Gi 

(46) 

admits a unique differentiable solution Pi{-), then 
a{t) = -^i(t)7Vfi(t)57(t)x“i’“’^(t) 

+ Si{t)N^\t)Sj (t)x“i■“=(!) - 

+ [ - A^(t) + Si{t)N-^(t)Bj{t)\m (47) 

- (t)x(t)l3{t) - S 2 {t)]u 2 (t) 

+ [Siit)N-\t)S{t) - S2it)]u2it), 

where 

S2it) := Pi{t)B2{t) + C^{t)Pi{t)D2{t). 

And the BSDE (35) takes the form 


< 


-d(p{t) = [A^{t)ip{t) + (i?2(t) - B2(t)YYid) 

+ {t)x{t)j3{t) + S'i(t)x“i’“’^(t) 

- S'i(t)x“i’“’^(t) + S2{t)u2{t) 

+ (S'3(t) - S2{t))u2{t)\dt - (3{t)dY{t), 


V{T) = 0, 


(48) 


where we set 


Applying Theorem 8.1 of [14] to (24) and (21), we have 

dZ{t) = -hY{t)z{t)mdw{t), 

Z(0) = 1, 

where dW{t) := dY{t) — hJ{t)x{t)dt. 

By (33), and supposing that 
(A3.2) Ni{t)>0,forallt£[Q,T], 
we immediately arrive at 

ul{t) = M];(t;x“i’“"(t),'U2(t),'^(t),/3(t)) 

= -N^\t) {t)x<^^Yt) + S{t)u 2 {t) + Bj{t)m], 

(45) 


r B 2 {t) := B 2 (t) - Bi(t)N-\t)S{t), 
lsj{t):=Siit)N^\t)Sjjt), 

[ Ssit) := -Siit)Nr\t)S{t)+S2{t). 

We rewrite (46) as the following 

' Piit) = -Pi{t)A{t)-A^it)Pi{t) 

- it)Piit)C{t) - Qiit) 

+ [Pi{t)Bi{t)+C^{t)Piit)Diit)] 

< [Niit)+Dj{t)Pi{t)Di{t)]-^ (49) 

[Piit)Bi{t)+C^it)Pi{t)Di{t)]^, 
PiiT)=Gi, 

^ Niit)+Dj{t)Piit)Diit) > 0, Vt e [0,T], 
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which is similar to the classical Riccati equation (6.6), /3(-)) is the unique Fj-adapted solution 

Chapter 6 in Yong and Zhou [28], or (5.2) in [26]. Thus to the FBSDFE (52). 
its solvability can be guaranteed. 


For given W 2 (-); plugging the optimal control u*(-) of 
(45) into (42), we have 

' dx<’^%t) = [A{t)x<’^^t) + S.i{t)if{t) 

+ B2{t)u2{t) - h^{t)d{t)^t)]dt 
+ C{t)dY{t), 

^ i“i’“no) = xo, 

which admits a unique Jy^-adapted solution x“i’“^(-), 
where we denote 

Siit) :=-Biit)N^\t)Bjit). 


Applying Lemma 5.4 in Xiong [24] to (48), we have 

( -d(p{t) = [A^it)(pit) + h^it)xit)(3{t) 

■I + S3it)u2it)]dt - l3{t)dY{t), (51) 

[ FiT)=0, 

where _ 

Ait) := Ait)-Biit)Nr\t)Sjit). 

Putting (50) and (51) together, we get 

' dx<’^%t) = [Ait)x<’^^t) + S^it)ipit) 

+ B 2 it)u 2 it) - h^it)Cit)xit)\dt 
YCit)dYit), 

—dcfiit) = [A^it)(pit) + h^ it) X it) Pit) 

+ S3it)u2it)]dt - Pit)dYit), 

. f<’“-(0) =xo, piT)= 0. 


Note that (52) is an FBSDFE, whose solvability can be 
easily obtained, for given M 2 (-)- 

Moreover, we can easily check that the convex¬ 
ity/concavity conditions are satisfied. Then by Proposi¬ 
tion 2.2, uli') by (45) is really optimal. 

We summarize the above in the following theorem. 

Theorem 3.1 Suppose that assumptions (A3.1), 
(A3.2) hold and the Riecati equation (49) admits a 
unique differentiable solution Pi(-)- For ehosen U 2 (') of 
the leader, LQ Problem of the follower admits an 
optimal control u\i-) = m][ (•; (•), ■U2 (-)) (^(O)/^(O) 

of the state feedback form (45), where process triple 


Remark 3.1 Due to the explicit appearance of control 
M 2 (-) in (31), (32), we introduce the Jy^’^-adapted pro¬ 
cess (fii') in (34) which satisfies the BSDE (35). This is 
different from [28] but similar to [26]. More importantly, 
we will see in next subsection that (48) can be regarded 
as the new backward “state” equation of the leader. 


Remark 3.2 Noting that (52) is an FBSDE system, 
the value (x“i’“^(<),i^(t),/?(<)) of (ai“i’“^(-),i^(-),/3(-)) 
at time t depends on {{( 2 ( 5 ); s € [0,T]}. Thus, by (45), 
uli') depends on {{( 2 ( 5 ); s € [0,T]}. This means that 
w]](-) is generally anticipating. Thus, it is important to 
find a “real” state feedback representation for «][(•) in 
terms of the original state x“i’“^(-). This work will be 
done in the end of this section via the dimensional aug¬ 
mentation introduced by Yong [26], together with the 
optimal control for the leader. 


3.2 The Leader’s LQ Problem 


Now, we are ready to study the stochastic optimal con¬ 
trol problem of the leader. Knowing that the follower 
would take his optimal control ■u*(-) = ■u*(-;x“i’“^(-), 
{(2(-), (^(O,/3(-)) by (45), his state equation (32) writes 


(ix“^(t) = |^A(t)x“^(t) -I- [Aft) — Aft))it) 
+ SffQifiit) + B2it)u2it) 


Y 


Y (^ 2 ( 1 ) - B2if))u2it'^ dt 

"cit)x’^^t)YS5it)F^t) (53) 

L 

-b BJit)(pit) Y D2it)u2it) 

Y Seit)u2it)] dWft) Y CiQdWft), 


y a;“=(0) = xo. 


where we denote x“^(-) = a;“*’“ 2 (.)^a;“ 2 (.) = x“b“ 2 ^.) 
for simplicity and 


(Bfft) :=-B3it)Nj\t)Djit), 
\ Sfft)-.= -Dfft)Nj\t)Sjit), 
[ Sfft) := -Dfff)Np\t)Sit). 
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The leader would like to choose an J^t-adapted optimal 
control M 2 (-) such that his cost functional 





+ {N2{t)u2{t),U2{t))^dt 


+ (G2x“"(r),x“=(T)) 


(54) 


is minimized, where Q 2 { ) G >5",iV2(-) G S™^ are de¬ 
terministic, bounded matrix-valued functions, and G 2 G 
5”. We introduce the following assumption. 

(A3.3) Q 2 {t) > 0, for all t G [0, T] and G 2 > 0. 


The admissible control set U 2 of the leader is defined as 
(3) in Section 1.2, for t/ 2 ,i = in this section. 


As mentioned in Remark 3.1, the leader has to include 
the process pair ((/?(•), /3(-)) as part of his new state pro¬ 
cesses, since l^(•) is involved in the coefficients of (53). 
Thus for any U 2 {-), the state equation of the leader is 


f dx“"(t) 


< 

—d(p{f) 


A(t)x“^(t) -I- (A(t) — A(t))x“^(t) 
-k SA(t)tf{t) + B2{t)u2(t) 

-k {B2{f) - B2{f))u2{t) 


dt 


+ 


G(t)x'^^{t) + S5{f)x'^^{t) 

+ BJ {t)(p{t) + D2{t)u2{t) 

+ Se{t)u2it) dW{t) + G{t)dW{t), 
A^(t)ip{t) + {B2{t) - B2(t)y f:{t) 


+ 5l(t)x“^(t) -5l(t)x“^(t) 

+ S2{f)U2{t) + (Ssit) - S2it))u2it) 


dt 


- I3{t)dw{t), 
x^H0)=xo, viT)=0, 

(55) 

which is a conditional mean-field FBSDE. Its solvability 
for J^(-adapted solution (a;“^ (•),(/?(•),/3(-)) can be easily 
guaranteed (since the solvability of FBSDFE (52) for 
{x^^{-),(f{-),f3{-)) has been obtained). And we will de¬ 
note the above as LQ Problem of the leader. 


Since the process triple k{-), k{-)) has been replaced 
by the new pair ((/?(•),/?(•)), we define the Hamiltonian 
function H 2 as 


H2{t,x'^^,U2,(p,(3;y,z,z,p) 

= {y, A{t)x'^^ + {A{t) - A(t))x“^ -k S4it)ip 
+ B2{t)u2 -k {B2{t) - B2{t))u2) 

-k {z,G{t)x'“^ + 5'5(t)i;“^ -k bJ{ t)(p 
-k D2{t)u2 -k SQ(t)u2'j -k (^Z, G(t)^ 


+ {p,A^{t)(f+ {B2{t) - B2{t)y(fi + Si{t)x'^^ 

- S'i(t)x“^ -k S2{t)u2 + (Ssit) - S2it))u2) 

+ ^[(Q2(^)a;"^a;“^) -k (iV2(t)M2, W 2 )] ■ 

(56) 


And the adjoint equation (17) writes 


dp{t) = A{f)pit) -k (^ 2 ( 1 ) - B2{t))p{t) 


■Sj{t)y{t) +Bi{f)z{f) 


dt. 


-dy(t) = ^A^{t)y{t) + {A{t) - A{t)) y{t) 

+ G^{t)z{t) + Sj {t)z{t) 

+ Si{t)p{t) - Si{t)p{t) + Q2it)x*{t) 

— z{t)dW{t) — z{t)dW(t), 
p(0) = 0, y(T) = G 2 X*(T), 


dt 


with 


(57) 


p{t) := E[p(t)|jf], y{t) := E[i/(t)|jf], 
m ■.= ^[z{t)\Fj], 


and (p(-), y(-), z(-)) G R” x R” x R” being J't-adapted 
processes. Note that (57) is an FBSDE. 


We apply the maximum principle approach in Section 
2.2 to solve LQ Problem of the leader, which now is a 
complete information one since t/ 2 ,t = Bt- We split this 
process into three steps. 


Step 1. (Optimal control) 
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Suppose that there exists an J^t-adapted optimal control 
W 2 (-) G U 2 for the leader, and the corresponding optimal 
state (x*(-),p*(-),l^*(-)) = (x’‘^(-),(p*(-),/3*(-)) is the 



solution to (55), that is, 


and (suppressing some t below) 


dx*{t) = A(t)x*{t) + {A{t) — A{t))x*(t) 

+ S 4 {t)(p*{t) + B 2 {t)u 2 {t) 

+ {B 2 {t) — B 2 {t))u 2 {t) dt 
+ C {t)x* (t) + S 5 {t)x* (t) 

+ BJ + D2{t)u*2{t) 

< + Seit)u* 2 it)] dW{t) + C{t)dW{t), 

-dip*{t) = A^{t)ip*(t) + {B 2 (t) - B 2 (t))^(f*(t) 

+ Si{t)x*{t) — Si{t)x*{t) 

+ S 2 {t)u 2 {t) + (Ssit) - S 2 (t))u 2 (t) dt 

-f3*{t)dW{t), 
x*{0)=xo, ‘f*iT)=0. 

(58) 

Then (56) together with Proposition 2.3 yields that 


0 = N 2 (t)ul(t) + Sj(t)p(t) + (Ssit) - 5'2(t))^(t)p(t) 
+ bJ{ t)y{t) + {B 2 {t) - B 2 {t)Yy{t) 

+ DJ (t)z{t) + S'g^ {t)z{t), a.e. t e [0, T], 

(59) 


Moreover, we can easily check that the convexity condi¬ 
tion is satisfied. Then by Proposition 2.4, ■U 2 (-) defined 
by (59) is really optimal. 


Step 2. (State feedback representation) 


The representation of m^)-) through (59) is not satisfac¬ 
tory. We expect to obtain its state feedback representa¬ 
tion via some Riccati equations. For this target, let us 
regard the (a;*(-),p(-))^ as the optimal state and put 



Then (58) with (57) is equivalent to the following con¬ 
ditional mean-field FBSDE 

dX{t) = [AiX(t) + A 2 X(t) -b Bi$(t) 

+ BiZ{t) + B 2 U* 2 {t) + B 2 U* 2 {t)\dt 

-\- [Ci^(t) -|- C2^(t) -l- B^ ‘h(t) 

-b V 2 U *2 {t) + V 2 U *2 (t)] dW(t) -b CdW(t) , 

< -d$(t) = [Q 2 X(t) -b Q 2 X(t) -b Aj $(t) 

+ A^ l>(t) + Cj Z{t) + Cj Z{t) 

+ B 3 U 2 (t) + B 3 U 2 (t)] dt 
- Z(t)dW{t) - Z{t)dW(t), 

^ XiO) = Xo, $(T) = ^2^(T), 

(61) 

where 

Xit) :=E[X(t)|j-r], <i>(t) :=E[<i>(t)|j-r], 
Z{t):=E[Z{t)\Xr]. 



Supposing that 

(A3.4) A^^(<) exists, for all t G [0,T], 
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then from (59), for a.e. t £ [0, T], we have 


^(t) = [5lX(t) + BjX(t) + Bjjit) 

+ Bjm+VjZit)+VjZit)], 


(63) 


and 


'4it) = -N2^[{B3+B3yx{t) + {B 2 +B 2 yMt) 


+ {V2 + V 2 ) Zit)]. 

Putting (63), (64) into (61), and letting 


(64) 


' := A - B 2 N^^BJ, 

A 2 := A 2 - B 2 N 2 -^BI - B2N^^B3 + ^ 3 )^, 

Bi := -B 2 N^^B^, B 2 := -B 2 N^^V^, 

Bi := Bi - B2N-^BJ - B2N-\B2 + ^ 2 )^, 

B 2 := Bi - B2N^^VJ - B2N^^ {V 2 + V 2 Y, 

Cl := Cl - V2N-^BJ, V 2 := -V2N-^Vj, 

C 2 := C2 - V2N^^BJ - V2N^\B3 + B^y, 

V 2 := -V2N-^VJ - V2N-^ {V 2 + V 2 Y, 

Q 2 ■■= Q 2 - b^n^^bJ, 

. Q 2 ■■= Q 2 - B^N^^BJ - BsN^^ (S 3 + Bsf, 
we get 

dX{t) = [AiX{t) + A2Xit) + Si$(f) 

+ Ii$(t) + B 2 Z{t) + B 2 Z{t)] dt 

+ [CiX{t)+C2X{t) + B2<i>{t) 

+ 61 $(t) + V2Z{t) + V2Z{t)] dW{t) 

' +CdW(t), 

-d$(t) = [Q2Xit) + Q2Xit)+Al^t) 

+ A 2 $(t) + Ci^ Z{t) + cj Z{t)]dt 

- Z{t)dW{t) - Z{t)dW{t), 

^ x(o) = Xo, $(r) = e2X(T). 

(65) 

Up to now, we have every reason to suppose that 

=Vi{t)X{t) +r2it)Xit) + V3{t), (66) 


due to the terminal condition in (65), where 'Pi (-)5 ^ 2 (-) 
are both differentiable, deterministic S^"-valued func¬ 
tions with 'Pi(T) = Q 2 -,'P 2 {T) = 0, and Jy^’^-adapted 
process 7^3(-) satisfies BSDE 

r -dVsit) = \{t)dt - Q3{t)dY{t), 

[ v-AT) = 0. <"> 


In the above, A(-) € is Jy^’^-adapted and Q 3 (-) € 
is FY -adapted, which will be determined later. 


Our remaining task is to decouple the conditional mean- 
field FBSDE (65), by (66) and (67). This will lead 
to a derivation of our system of Riccati equations for 
1 = 1 , 2 - 


First, by (65) and (21), we have 


f dX{t) = 


< 


I ^(0) = 

Noting that 


AiX(t) + A2X{t) + Bi>^{t) 


+ Bi^{t) + B2Z{t) + B2Z{t) 



+ [CiX{t)+C 2 X{t)+B 2 ^{t) 

+ i[$(t) -b V 2 Z{t) + V 2 Z{t)] dw{t) 

-|- CdY [t), 

Xo. 


X{t-,P) :=E[X{t)\Fn = 


E [Z{t)X{t)\F^ 

E[Zit)\Fr] 

■Y-l 


= X{t) :=E[X(t)|J-,'^], etc.. 


then applying Lemma 5.4 of [24] to (68), we get 


[ dX{t) = [(A + A 2 )Xit) + (Si + Si)$(t) 
+ (B 2 + B 2 )Z{t) 

I 1(0) = Xo. 

Also, by the backward equation of (65), we have 


dt 4- CdY{t), 

(69) 


-d$(t) = [Q2X(t) + Q2X(t) + A, $(t) 

+ aJ l>(t) + cj z{t) + cl Z{t) 


hA {t)x{t) 0 

0 0 


Z{t)\dt 


(70) 


- Z{t)dW{t) - Z{t)dY{t), 

[ $(T) = ^2A(T). 
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Applying Ito’s formula to (66), by (68), (69) and (70), 




we obtain 


of (72), and supposing that 


=V^{t)X{t)dt + Vi{t)dX{t) 

+ V 2 {t)X(t)dt + V 2 (t)dX{t) + dV:i{t) 
--[[Vi+ViAi+ViBiVi]X{t) 

+ \p2 + ^2 (-^1 + > 42 ) + 7^2 {Bi + 

+ Pi (Si + Si)P2 + P 2 (Si + Si)P2 
+ Pi: 42 + PiSiPi]l(t) + ViB2Z{t) 

+ [PiS2+P2(S2+S2)]i(t) 


-PiSi 


P3(i)-Pi( 


C 0 


+ T^iBiP^it) — P 2 ^C 0 
+ P2(Si+Si)P3(t)- 


0 

hA {t)x{t) 


dt 


+ [[ViCi+ViBlVi]X{t) + [P 1 C 2 


+ PiSj V 2 + Pi Si (Pi + P 2 )] X{t) 

+ PiSjP3(t) + Pis[p3(7) + VAD2Z{t) 




+ ■ 


ViV^Z{t)^dW{t) 

+ [{Vi+V2)C+QAt)]dY{t) 

- {[Q 2 +:4[pi] x(t) + [I 2 +:4jpi 
+ -4i P 2 + -42 ”^ 2 ] X (t) + Z (t) 

—T ', , ( hJ{t)x{t) 0 

V 0 0 

+ -4i P3(7) + -42 P3(t) j-dt 
+ Z{t)dW{t) + Z{t)dY{t). 


m 


(71) 


Comparing the coefficients of {■}dW(t) and {•}dY(t) on 
both sides of the last “ = ” of (71), we have 


Z{t) = [PiCi +PiSjPi]X(t) + [P 1 C 2 

+ PiS 2 ^ P 2 + vX (T’l + P 2 ) ] ^ {t) ^^ 2 ) 

+ T’iT> 2 Z (t) + 'PiD 2 Z{t) 

+ PiSjP3(t)+Pis[p3(7), 

and 

Z(t) = (Pi+P2)C+Q3(7)- (73) 

Taking the conditional expectation E[- \J'J] on both sides 


(A3.5) := [hn - Vi{V2 + V2{t))] ^ 

■■= [l2n+Vl{V2+V2)N^\V2+V2YY^ 

exists, 


we have 


Z{t) = J^Y{ [Pi (Cl + C 2 ) + Pi (S 2 + Si)^ 

X (Pl+P2)]A(t)+Pi(S2+Si)^P3(f)}. 

(74) 

Inserting (74) into (72), and supposing that 

(A3.6) X'Y ■■= [hn - P 1 V 2 ] 

:= [hn + PiP 2 fV 2 "^Pj] ^ exists, 


we have 


Zit) = [PiCi + PiSjPi] A(t) 

+ [P 1 C 2 + PiSj P 2 + Pis[ (Pi + P 2 ) 
+ Pip27^2”M7'i(Ci+C2) 

+ Pl(S2+Si)^(Pl+P2)]]A(t) 

+ PiSjP3(t) + [Pis[ + P 1 P 2 

xA4-iPi(S2+Si)^]P3(7)}- 


(75) 


Comparing the coefficients of {■}dt in (71) and substi¬ 
tuting ( 73 ), (74), (75) into it, we obtain the following 
coupled system of Riccati equations 


0 — Pi -l- PiAi -f Ai Pi -l- PiSiPi -l- Q 2 

+ (cj + riB2)JfYPi{cJ + V1B2Y, 

0 = P 2 + P 2 (-4i -b A 2 ) Y (Ai -f- A 2 ) P 2 
-bP2(Si -bSi)Pi -bPi(Si -bSi)P2 
+ P 2 (Si Y Si)P 2 -f- Pi A 2 -b A 2 Pi 
< + PiSiPi YQ 2 Y {Cj Y P 1 S 2 ) (76) 

X 7^^ Vi [C 2 + Si Pi + (S 2 + Si)^P 2 ] 

-b [(Ci^ -bPlS2)7V’^VlP2 -bCj -bPlS2 
+ P 2 (S 2 + S 2 )].A' 2 ”' [T’l (Cl + C 2 ) 

+ Pl(S2+Si)^(Pl+P2)], 

.Pi(t) = 02, P2(r) = o, 
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and 


A(i) =[aI; +7^1^! + (C7 + ViB2W^ 

+ \a^ +ViBi+P 2 {Bi+Bi) 

+ {Cj + VlB 2 )Jf 2 ^ [Pli[ + V 1 V 2 

xM^^Vi ^2 + Bi)^]Mt) 

h7 it)x{t) 0 \ 

Qsit) 

0 0 / 

hJ{t){x{t) — x{t)) 0 


+ 


V 2 C. 


Applying Lemma 5.4 in Xiong [24] to (67), we have 


(77) 


r -dA{t) = { 


Ai + A2 + (Vi + P2) (Bi + Bi) 


+ {Cj + ViB2)M2 ^ [bI + ViB^ 

Pi' 

hJ{t)x{t) 0 


+ ViV2M2^Vi{B 2 + B^)^]]Mt) 


\ 0 0 

- Q 3 {t)dY{t), 


Qsit) }dt 


[ V3{T) = 0. 

(78) 

Noting that the above hltered BSDE (78) satisfies the 
stochastic Lipschitz condition in Wang et al. [22], then 
it admits a unique Jy^-adapted solution (Vsi-), Qs)-))- 
Thus the BSDE (67) admits a unique Jy^’^-adapted 
solution (Vsi-), Qsi’))- 


Note that the system of Riccati equations (76) is entirely 
new. For its solvability, we first rewrite the first equation 
for 7^1 (■) as follows 


'0 = A+Vi [Ai-B2N2^BJ] 

+ [Al - B2N2^BJ]'^Vi - ViB2N2^BjVi 
< + {Cj - riB2N^^Vj) [hn + ViV2N2^VJ] 

X Vi {Cj - ViB2N2^vJY + Q 2 , 

Vi{T) = g 2 . 

(79) 

Note that 


equation theory, (79) admits a unique 5^”-valued solu¬ 
tion. However, the solvability of the second equation for 
7 ^ 2(0 is widely open. In this paper, we only impose the 
solvability of it as an assumption. 

Putting (66), (74), (75) into (63), we obtain that 

um = [Hg^ + BjVi + 

X {Ci + Bjvi)]x{t) + Y,{ri,r 2 )xit) (so) 

+ E2(iPl)iP3(t) + [Bj +VjJ72"VlBj]V3{t)}, 


for a.e. t G [0,r], where 


' Si {Vi,V 2 ) ■■= Bj + V 2 B^{Vi + V 2 ) 

-\- vjJ\f2 ^ (ViC2 + V 1 B 2 V 2 + ViBi {Vi -f V 2 ) 
+ VlV2N2^[Vl{Ci+C2) 

+ Vi{B 2 + BiY {V 1 +V 2 )]) 

' +T)lM2^[Vl{C3+C2) 

+ Vi{B 2 + Biy {V 1 +V 2 )], 

S 2 {Vi) := [v3i + V 1 V 2 

x7^2-Vi(H2+Ii)^] 

+ VlM 2 ^Vi(B 2 +BiY. 


Step 3. (Optimal state equation) 


By (66), (73), (74), (75), we have decoupled the opti¬ 
mality system (65). And the optimal “state” X{-) = 

{x*{-),p{-))^ of the leader should be the Jy^’^-adapted 
solution to the conditional mean-field SDE 


' dX{t) ={Y3{Vi)Xit) + U{Vi,V 2 )Xit) 

+ Y 5 {Vi)V 3 {t) + Ye{Vi)V 3 {t)}dt 
+ {Yr{Vi)X{t) + Ys{Vi,V 2 )X{t) 

+ S9(7^i)7^3(t) -I- YiQ{Vi)V3{t)'^dW{t) 

+ CdW{t), 

. A(0) = Xo, 


where W(-) satisfies the filtered SDE 


(81) 


[l 2 n+VlV 2 N 2 ^Vj] ^Vl=Vl[l 2 n+V 2 N 2 ^VjVl] ^ 

is symmetric (which can be proved by multiplying both 
sides by [l 2 n + T’i7727V^^X>J] from left and by [l 2 n + 
1 ) 2 X 2 ^VjVi^ from right). Then by standard Riccati 


r dX{t) ={Yii{Vi,V 2 )X{t) + Yi 2 {Vi)A{t)}dt 
\ +CdW{t), 

[ X(0) = Ao, 

(82) 
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with 


' S 3 ( 7 ^i) := ^1 + BiVi + B2Jf2 ' [PiCi + ViBIVi] , 
^4(7^!)7^2) •= -^2 + B1P2 + Bi{T ^i + T’2) 

+ B2Jf2^ \V1C2 + PiBJ r2 + (iPl + V2) 

+ VlV2Kf2^ [Pi (Cl + C2) + iPl (^2 + Si) ^ 

X (T’l +^2)]] +S 27 ^ 2 ”M^i(Ci +C2) 

+ iPl(S 2 +Si)^(iPl+iP 2 )], 

£5(7^1) :=Si+S 2:^2 ViSj, 

S 6 (^i) := Si + S 2:^2 ' [^ 1 ? + ^ 1^2 

X N^^Vi([B2 + Si)^] + B2N^^Vi[B2 + Si)^, 

S 7 (iPi) := Cl + sJiPi + ©2:^2"' [^iCi + PiBIPi] , 
^ 8 (^ 1 , P2) := C2 + sJiP2 + s[ (iPi + P2) 

+ V2Jf[^ [P1C2 + PiB[P2 + Pi Si (Pi + P2) 

+ Plp 2 ^ 2 ”'[^l(Cl+C 2 ) 

+ Pl(S 2 +Si)^(Pl+P 2 )]] 

+ V2N2 ^ [Pi (Ci + C2) 

+ Pl(S 2 +Si)^(Pl+P 2 )], 

E 9 (Pi) :=sJ +V2JI[^PiBI, 

EioCPi) := Si + ^2:^2'' [Pi^i + Pi^2 

X M2^Vi(B2 + Si)^] + V2J^2^Pi{B2 + Si)^, 

Sii(Pi,P2) := A1+A2+ (Si + Si) (Pi + P2) 

+ S 2 :^^^[PiCi+PiSjPi] 

+ B 2 M[^ [P1C2 + Pisj P2 + PiSi (Pi + P2) 

+ Plp 2 ^ 2 ”'[^l(Cl+C 2 ) 

+ Pl(S 2 +Si)^(Pl+P 2 )]] 

+ S 27 ^ 2 ”'[^i(Ci+C 2 ) 

+ Pl(S 2 +Si)^(Pl+P 2 )], 

Si 2 (Pi) :=Si +Si +S 2 :^^ViSj 
+ ^2:^2"^ [Pi Si +P1P2 

X 7^2"^Pi(^ 2 + Si)^] + S2A/'2 -^Pi(S 2 + Si)^. 

We summarize the above in the following theorem. 

Theorem 3.2 Suppose that assumptions (A3.1)~ 
(A3.6) hold and the system of Riccati equations (76) 
admits a differentiable solution pair {Vi{-),T’ 2 {-))- 
X{-) he the P^-arfaptec? solution to (82), and A(-) be 


the iFt-adapted solution to (81). Define (Y(-), Z(-), Z(-)) 
by (66), (75), (73), respectively. Then (65) holds, where 
{Psi'), Qsi')) is the unique -adapted solution to 

(67). Moreover, the state feedback controlu^i') definedby 
(80) is an optimal control for Problem of the leader. 

Noting that the optimal control ■U 2 (-) for the leader given 
by (80) is nonanticipating. Likewise, for the follower, the 
optimal control ul{t) = u\{t\x*{t),U 2 {t),(p*{t),l 3 *{t)') 
can also be represented in a nonanticipating way. In fact, 
by (45), noting (64), (66), (74) and (73), we have 

ul{t) = -iVf^ [Sjx*{t) + Siilit) 

+ Bj^*{t)+Djl3*it)] 

= - 0 ) x{t) + Su* 2 it) 

+ (0 Bj ) $(t) + (0 Dj ) Z{t) 

= - iVi-i [ ( 0 ) - 8 X 2 ^ [Bj + S2^ Pi 

+ P2^^2"Vi(Ci +SjPi) +Ei(Pi,P2)] 

+ (0 Si^) (Pi+P 2 )]A(i) 

- iVi"! [ (^ 0 [l]2(Pi) 

+ S2^ +P2^^2’'PlSj]]P3(t) 

- iVfi ( 0 57 ) [(Pi + P2)C + Qsit )], 

for a.e. t G [0, T], which is an observable state feedback 
representation for the optimal control of the follower, 
where A(-) satisfies (82) and (Ps)-); Qs)')) satisfies (78). 

Up to now, we have solved our LQ leader-follower 
stochastic differential game with asymmetric informa¬ 
tion, and it admits an open-loop Stackelberg equilibrium 
(«);(•),M 2 (•))■ Its state feedback representation is (83) 
and (80), respectively. And the corresponding optimal 
state equation of the leader is (81), the corresponding 
optimal state (observable) equation of the follower is 
(82). 

Remark 3.4 When we consider the complete informa¬ 
tion case, i.e., W{-) disappears and Qi^t = At, Theorems 
3.1 and 3.2 coincide with Theorems 2.3 and 3.3 in [26]. 

4 Concluding Remarks 

In this paper we have discussed a leader-follower stochas¬ 
tic differential game with asymmetric information, or 
named a stochastic Stackelberg differential game with 
asymmetric information. This kind of game problem pos¬ 
sesses several attractive features worthy of being high¬ 
lighted. First, the game problem has the Stackelberg or 
leader-follower feature, which means the two players act 
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as different roles during the game. Thus the usual ap¬ 
proach to deal with game problems, such as Yong [25], 
Hamadene [7], Wu [23], An and 0ksendal [1], Wang and 
Yu [21], Yu [29], Hui and Xiao [10,11], Shi [17] where 
the two players act as equivalent roles, does not apply. 
Second, the game problem has the asymmetric infor¬ 
mation between the two players, which was not consid¬ 
ered in Yong [26], 0ksendal et al. [15] and Bensoussan 
et al. [5]. In detail, the information available to the fol¬ 
lower is based on some sub-cr-algebra of that available to 
the leader. Stochastic filtering technique is introduced 
to compute the optimal filtering estimates for the corre¬ 
sponding adjoint processes, which perform as the solu¬ 
tion to some FBSDFE. Third, the open-loop Stackelberg 
equilibrium is represented in its state feedback form for 
the partial observation case of LQ problem, under some 
appropriate assumptions on the coefficient matrices in 
the state equation and the cost functionals. Some new 
conditional mean-field FBSDEs and system of Riccati 
equations are first introduced in this paper, to deal with 
the leader’s LQ problem. 

Note that in principle. Theorems 3.1 and 3.2 provide a 
useful tool to seek open-loop Stackelberg equilibrium. As 
a first step in this direction, we apply our results to LQ 
models to obtain explicit solutions. We hope to return 
to the more general case when the states are not linear 
or the costs are not quadratic in our future research to 
completely solve the problems in the motivating exam¬ 
ples introduced in the introduction. We expect that ex¬ 
plicit solutions will not be available for these examples 
and numerical approximations will be studied. It is wor¬ 
thy to study the closed-loop Stackelberg equilibrium for 
our problem, as well as the solvability of the system of 
Riccati equations (76). In addition, many more partially 
observable cases which are more important and reason¬ 
able for applications and technological demanding in its 
filtering procedure, are highly desirable for further re¬ 
search. These challenging topics will be considered in 
our future work. 
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